Dynamic Unit Selection for Very Low Bit Rate Coding at 500 bits/sec
نویسندگان
چکیده
This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technologies. The aim of this approach is to use at best the speech corpus of the speaker. The proposed solution uses HMM modelling for the recognition of elementary speech units. The HMM are first trained in an unsupervised phase and then are used to build the synthesis unit corpus. The coding process relies on the synthesis unit selection. The speech is decoded by concatenating the selected units through HNM-like decomposition of speech. The new unit selection aims at finding the unit that best match the prosody constraints to models its evolution. It enables the size of the synthesis unit corpus to be independant of the targeted bit rate. A complete quantisation scheme of the overall set of encoded parameters is given.
منابع مشابه
Ultra low bit-rate speech coding based on unit-selection with joint spectral-residual quantization: no transmission of any residual information
A recent trend in ultra low bit-rate speech coding is based on segment quantization by unit-selection principle using large continuous codebooks as a unit database. We show that use of such large unit databases allows speech to be reconstructed at the decoder by using the best unit’s residual itself (in the unit database), thereby obviating the need to transmit any side information about the re...
متن کاملAn unified unit-selection framework for
We propose a unified framework for segment quantization of speech at ultra low bit-rates of 150 bits/sec based on unit-selection principle using a modified one-pass dynamic programming algorithm. The algorithm handles both fixedand variablelength units in a unified manner, thereby providing a generalization over two existing unit selection methods, which deal with ‘single-frame’ and ‘segmental’...
متن کاملLow Rate Speech Coding Using Contour Quantization
Vector quantization-based approaches to speech coding have generated new interest in very low bit rate speech coding, that is, speech coded to bit rates below 1200 bits/sec. To achieve such low bit rates, it is necessary to quantize the pitch and energy parameters at rates below 100 bits/sec. Contour quantization is introduced as a technique in which the contour of a given parameter is normaliz...
متن کاملOptimum Drill Bit Selection by Using Bit Images and Mathematical Investigation
This study is designed to consider the two important yet often neglected factors, which are factory recommendation and bit features, in optimum bit selection. Image processing techniques have been used to consider the bit features. A mathematical equation, which is derived from a neural network model, is used for drill bit selection to obtain the bit’s maximum penetration rate that corresponds ...
متن کاملIntra-frame and Inter-frame Coding of Speech LSF Parameters Using A Trellis Structure
Linear Predictive Coding (LPC) parameters are widely used in various speech processing applications for representation of the spectral envelope of speech. Low bit-rate speech coding applications, require accurate quantization of these parameters using as few bits as possible. Line Spectral Frequency (LSF) representation is the most widely accepted representation of LPC parameters for quantizati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004